Single Microphone Blind Audio Source Separation Using EM-Kalman Filter and Short+Long Term AR Modeling

Bensaid, Siouar; Schutz, Antony; Slock, Dirk T. M.

doi:10.1007/978-3-642-15995-4_14

Siouar Bensaid²¹,
Antony Schutz²¹ &
Dirk T. M. Slock²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6365))

Included in the following conference series:

International Conference on Latent Variable Analysis and Signal Separation

3188 Accesses
2 Citations

Abstract

Blind Source Separation (BSS) arises in a variety of fields in speech processing such as speech enhancement, speakers diarization and identification. Generally, methods for BSS consider several observations of the same recording. Single microphone analysis is the worst underdetermined case, but, it is also the more realistic one. In this article, the autoregressive structure (short term prediction) and the periodic signature (long term prediction) of voiced speech signal are modeled and a linear state space model with unknown parameters is derived. The Expectation Maximization (EM) algorithm is used to estimate these unknown parameters and therefore help source separation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

http://www.eurecom.fr/~bensaid/ICA10
Cichocki, A., Thawonmas, R.: On-line algorithm for blind signal extraction of arbitrarily distributed, but temporally correlated sources using second order statistics Neural Process. Neural Process. Lett. 12(1), 91–98 (2000)
Article MATH Google Scholar
Barros, A.K., Cichocki, A.: Extraction of specific signals with temporal structure. Neural Comput. 13(9), 1995–2003 (2001)
Article MATH Google Scholar
Tordini, F., Piazza, F.: A semi-blind approach to the separation of real world speech mixtures. In: Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN 2002, vol. 2, pp. 1293–1298 (2002)
Google Scholar
Smith, D., Lukasiak, J., Burnett, I.: Blind speech separation using a joint model of speech production. IEEE Signal Processing Letters 12(11), 784–787 (2005)
Article Google Scholar
Chu, W.C.: Speech coding algorithms-foundation and evolution of standardized coders. John Wiley and Sons, NewYork (2003)
MATH Google Scholar
Feder, M., Weinstein, E.: Parameter estimation of superimposed signals using the EM algorithm. IEEE Trans. Acoust., Speech, Signal Processing 36, 477–489 (1988)
Article MATH Google Scholar
Gannot, S., Burshtein, D., Weinstein, E.: Iterative-batch and sequential algorithms for single microphone speech enhancement. In: ICASSP 1998, pp. 1215–1218. IEEE, Los Alamitos (1998)
Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Society B 39, 1–38 (1977)
MathSciNet Google Scholar
Gao, W., Tsai, S., Lehnert, J.: Diversity combining for ds/ss systems with time-varying, correlated fading branches. IEEE Transactions on Communications 51(2), 284–295 (2003)
Article Google Scholar
Couvreur, C., Bresler, Y.: Decomposition of a mixture of Gaussian AR processes, Acoustics, Speech, and Signal Processing. In: 1995 International Conference on ICASSP 1995, vol. 3, pp. 1605–1608 (1995)
Google Scholar
Christensen, M., Jakobsson, A., Juang, B.H.: Multi-pitch estimation, Morgan & Claypool (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

EURECOM, 2229 route des Crêtes, B.P. 193, 06904, Sophia Antipolis Cedex, France
Siouar Bensaid, Antony Schutz & Dirk T. M. Slock

Authors

Siouar Bensaid
View author publications
You can also search for this author in PubMed Google Scholar
Antony Schutz
View author publications
You can also search for this author in PubMed Google Scholar
Dirk T. M. Slock
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Electrical Engineering, Universitè d’Evry Val d’Essone, 40 rue du Pelvoux, 91020, Courcouronnes, France
Vincent Vigneron
Laboratoire I3S, Les Algorithmes - Euclide-B, BP 121, Université de Nice-Sophia Antipolis, 2000 Route des Lucioles, 06903, Sophia Antipolis Cedex, France
Vicente Zarzoso
School of Engineering, Dept. of Telecommunications, ISITSchool of Engineering, Dept. of Telecommunications, ISITV, Université de Toulon, Avenue George Pompidou, BP 56, La Valette du Var, Cedex, 83162, France
Eric Moreau
INRIA France, Equipe-projet METISS, Centre de Recherche INRIA Rennes-Bretagne Atlantique, Campus de Beaulieu, 35042, Rennes cedex, France
Rémi Gribonval
INRIA France, Equipe-projet METISS, Centre de Recherche INRIA Rennes-Bretagne Atlantique, Campus de Beaulieu, 35042, Rennes Cedex, France
Emmanuel Vincent

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bensaid, S., Schutz, A., Slock, D.T.M. (2010). Single Microphone Blind Audio Source Separation Using EM-Kalman Filter and Short+Long Term AR Modeling. In: Vigneron, V., Zarzoso, V., Moreau, E., Gribonval, R., Vincent, E. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2010. Lecture Notes in Computer Science, vol 6365. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15995-4_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-15995-4_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15994-7
Online ISBN: 978-3-642-15995-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics